Noise robust speech recognition using spectral subtraction and F0 information extracted by Hough transform
نویسندگان
چکیده
We propose a noise robust speech recognition method based on combining novel features extracted from fundamental frequency (F0) information and spectral subtraction. F0 features have been shown to be effective in speech recognition in noisy environments. Recently, F0 features obtained by Hough transform were developed for concatenated digit recognition and significantly improved recognition performance of noisy speech. This paper proposes novel features based on Hough transform for large-vocabulary continuous speech recognition. In addition, spectral subtraction is applied before Hough transform to remove static noise. The proposed method was tested using the Japanese Newspaper Article Sentences (JNAS) database. Word accuracy was improved in all noise conditions, with the best absolute improvement being 2.6 points in percentage when station noise was added at 10 dB SNR.
منابع مشابه
Noise robust speech recognition using F0 contour extracted by hough transform
This paper proposes a noise robust speech recognition method using prosodic information. In Japanese, fundamental frequency (F0) contour represents phrase intonation and word accent information. Consequently, it conveys information about prosodic phrase and word boundaries. This paper first proposes a noise robust F0 extraction method using Hough transform, which achieves high extraction rates ...
متن کاملNoise Robust Speech Recognitio Extracted by Hough Tr
This paper proposes a noise robust speech recognition method using prosodic information. In Japanese, fundamental frequency (F0) contour represents phrase intonation and word accent information. Consequently, it conveys information about prosodic phrase and word boundaries. This paper first proposes a noise robust F0 extraction method using Hough transform, which achieves high extraction rates ...
متن کاملNoise Robust Speech Recognition Using Prosodic Information
This paper proposes a noise robust speech recognition method for Japanese utterances using prosodic information. In Japanese, the fundamental frequency (F0) contour conveys phrase intonation and word accent information. Consequently, it also conveys information about prosodic phrase and word boundaries. This paper first proposes a noise robust F0 extraction method using the Hough transform, whi...
متن کاملImproving the performance of MFCC for Persian robust speech recognition
The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...
متن کاملNoise-robust speaker verification using F0 features
This paper proposes a noise-robust speaker verification method augmented by fundamental frequency (F0). The paper first describes a noise-robust F0 extraction method using the Hough transform. Then, it proposes a robust speaker verification method using multi-stream HMMs which fuse the extracted F0 and cepstral features. Experiments are conducted using fourconnected-digit utterances of Japanese...
متن کامل